Performance Analysis on Uncertain Data using Decision Tree
نویسندگان
چکیده
Data uncertainty is common in emerging applications, such as sensor networks, moving object databases, medical and biological fields. Data uncertainty can be caused by various factors including measurements precision limitation. Data uncertainty is inherited in various applications due to different reasons such as outdated sources or imprecise measurement and transmission problems. Classification is one of the most popular data mining techniques. Lot of people used decision tree for data classification and it widely used on certain or precise data. However in this paper we applied on uncertain data which is taken from UCI machine learning repository. This paper proposes a decision tree based classification method on uncertain data. We construct decision tree algorithms by including entropy and information gain, considering the uncertain data intervals. We use some pruning techniques that can improve efficiency of the decision tree and our experiment show that it significantly reduce the tree-construction time.
منابع مشابه
Classification of Categorical Uncertain Data Using Decision Tree
Certain data is a data whose values are known precisely whereas uncertain data means whose value are not known precisely. But data is always uncertain in real life applications. In data uncertainty attribute value is represented by a set of values. There are two types of attributes in data sets namely, numerical and categorical attributes. Data uncertainty can arise in both numerical and catego...
متن کاملPerformance Analysis on Uncertain Data using Decision Tree
Data uncertainty is common in emerging applications, such as sensor networks, moving object databases, medical and biological fields. Data uncertainty can be caused by various factors including measurements precision limitation. Data uncertainty is inherited in various applications due to different reasons such as outdated sources or imprecise measurement and transmission problems. Classificati...
متن کاملAn Integrated DEA and Data Mining Approach for Performance Assessment
This paper presents a data envelopment analysis (DEA) model combined with Bootstrapping to assess performance of one of the Data mining Algorithms. We applied a two-step process for performance productivity analysis of insurance branches within a case study. First, using a DEA model, the study analyzes the productivity of eighteen decision-making units (DMUs). Using a Malmquist index, DEA deter...
متن کاملPresenting a Model to Assess Organizational Performance Based on the Concept of Knowledge Management Using Regression Model, Decision Tree, Gray Relational Analysis and DEMATEL Method (Case Study: National Library and Archives of Iran)
Many organizations have recognized that knowledge is the most important resource in today’s economy. With regards to knowledge-based views of the firm, organizations are actively embracing knowledge management with the expectation of acquiring and maintaining high levels of organizational performance. The relationship between knowledge management (KM) and organizational performance has been the...
متن کاملارزیابی عملکرد واحدهای تصمیمگیرنده با استفاده از تحلیل پوششی دادههای پنجرهای و درخت تصمیم
Efficiency is an issue of importance and interest to both managers of different organizations and customers who use the products and services of these organizations. The aim of this research is to study the efficiency of pharmaceutical companies accepted in the Stock Exchange Organization by using Window Data Envelopment Analysis (WDEA) and then, to provide some rules based on the decision tree...
متن کامل